Practicing Q-Learning

نویسندگان

  • Jörg Bruske
  • Ingo Ahrns
  • Gerald Sommer
چکیده

Q-Learning has gained increasing attention as a promising real time learning scheme from delayed reinforcement. Being compact, model free and theoretically optimal it is commonly preferred to AHC-Learning and its derivatives. However, it has long been noticed that theoretical optimality has to be sacrificed in order to meet the constraints of most applications. In this article we report of experiments with modified Q-Learning algorithms together with their key ingredients for practical success in reinforcement learning. These include optimistic initialization, the principle of piecewise constancy of policy and the use of activity traces. Finally, we extend these algorithms for growing RBF networks with additional on-line learning vector quantization (adaptive perceptualization) and obtain very encouraging results as well. Our test bed is pole balancing with additional noise on the sensory input.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Comparative Effect of Using Idioms in Conversation and Paragraph Writing on EFL Learners’ Idiom Learning

This study investigated the comparative effect of teaching idiomatic expressions through practicing them in conversation and paragraph writing on intermediate EFL learners’ idiom learning. The participants were sorted out of a population of 134 intermediate students in Zabansara Language School in Khorramabad based on their scores on a Preliminary English Test (PET) and an idiom test piloted in...

متن کامل

Peer Learning in Instrumental Practicing

In higher music education (HME), the notion of "private teaching, private learning" has a long tradition, where the learning part rests on the student's individual practicing between instrumental lessons. However, recent research suggests that collaborative learning among peers is beneficial in several aspects, such as sense of belonging, motivation and self-efficacy. This is consistent with th...

متن کامل

Lifelong learning along the education and career continuum: metaanalysis of studies in health professions

Introduction: Lifelong learning is an integral part of healthprofessionals’ maintenance of competence. Several studies haveexamined the orientation toward lifelong learning at variousstages of the education and career continuum; however, none haslooked at changes throughout training and practice. The objectiveof the present study was to determine if there are differencesbetween groups defined b...

متن کامل

Mini/Micro-Grid Adaptive Voltage and Frequency Stability Enhancement Using Q-learning Mechanism

This paper develops an adaptive control method for controlling frequency and voltage of an islanded mini/micro grid (M/µG) using reinforcement learning method. Reinforcement learning (RL) is one of the branches of the machine learning, which is the main solution method of Markov decision process (MDPs). Among the several solution methods of RL, the Q-learning method is used for solving RL in th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996